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1. Statement of results 
The set of integer solutions (xi, X2, X3) to the equation 

(1.1) xl + xl + xl = n 

has been much studied. However it appears that the spatial distribution 
of these solutions at small and critical scales as n — )• 00 have not been 
addressed. The main results announced below give strong evidence to the 
thesis that the solutions behave randomly. This is in sharp contrast to what 
happens with sums of two or four or more squares. 

First we clarify what we mean by random. For a homogeneous space like 
the A;-dimensional sphere with its rotation-invariant probability measure 
a, the binomial process is what you get by placing N points Pi, ... , Pn on 
independently according to a. We are in interested in statistics, that 
is functions /(-Pi, . . . ,Pn), which have a given behaviour almost surely, as 
A'^ — )• 00. If this happens we say that this behaviour of / is that of random 
points. We shall also contrast features of random points sets with those of 
"rigid" configurations, by which we mean points on a planar lattice, such as 
the honeycomb lattice. 

A celebrated result of Legendre/Gauss asserts that n is a sum of three 
squares if and only if n 7^ 4"(8& + 7). Let £{n) be the set of solutions 

(1.2) £{n) = {x e : |xp = n} 
and set 

(1.3) N = Nn:= #£{n) 

The behaviour of is very subtle and it was a fine achievement in the 
1930's when it was shown that Nn goes to infinity with n (assuming say 
that n is square- free; if n = 4° then there are only six solutions). It is 
known that Nn <C n^/'^~^°^^^ and if there are primitive lattice points, that 
is X = (xi,a;2,a;3) with gcd(a;i, X2, X3) = 1 (which happens if an only if 
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n 7^ 0,4,7 mod 8) then there is a lower bound of Nn ^ n^/'^^'^W . This 
lower bound is ineffective and indicates that the behaviour of Nn is still far 
from being understood [17] . 

The starting point of our investigation is the fundamental result conjec- 
tured by Linnik (and proved by him assuming the Generalized Riemann 
Hypothesis), that for n 7^ 0, 4, 7 mod 8, the points 

(1.4) 8{n) := -^£{n) C 5^ 

obtained by projecting to the unit sphere, become equidistributed on the 
unit sphere with respect to a as n — )• 00. This was proved unconditionally 
by Duke [U [S] and Golubeva and Fomenko [8j , following a breakthrough by 
Iwaniec [11] . Random points are equidistributed by definition and the above 
result says that on this crudest global scale the projected lattice points £{n) 
behave like random points. Figure [1] gives some visual support for random 
behaviour of S{n). 
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Figure 1. Lattice points coming from the prime n = 
1299709 (center), versus random points (left) and rigid points 
(right). The plot displays an area containing about 120 
points. 



To make this precise we examine various statistics associated with the 
placement of points in S"^. Our choice of these statistics is based on ro- 
bustness tests for the random hypothesis, as well as quantities which are 
of interest in number theoretical and harmonic analysis applications. Our 
philosophy in what follows is that the behaviour of a quantity in question is 
easy to determine for random points while for £{n) we settle for estimates 
for them and also formulate conjectures, which are more precise. That one 
has to settle for such information for this kind of problem is to be expected 
given the problematic non-random behaviour of the number A'^^ itself. 

1.1. Electrostatic energy. The electrostatic energy of points Pi, ... , P/v 
on S"^ is given by 

(1-5) P(Pi,...,Pjv) ■■=Y. \plp.\ 
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E-N{N - 1) 




integer random 


1224 


-282. 95. 


3072 


37732. -4704. 


4296 


8380. 1747. 



Table 1. The difference E — N{N — 1) between tlie elec- 
trostatic energy and its expected value, for various values of 
A^. In the column labeled "integer", the energy for £{n) was 
computed for the primes n = 104773, 104761 and 1299763 
with Nn listed in the left-most column. In the random case 
the result is a mean value of 20 runs. 



This energy E depends on both the global distribution of the points as well 
as a moderate penalty for putting the points too close to each other. The 
minimum energy configuration is known to satisfy |2H [22l [1] 

(1.6) iV2-/3iV3/2< niin E{Pi, . . . ,Pn) < N'^ - aN^'"^ 

Pi,...,Pn 

for some < a < /3 < oo. The configurations which achieve this are rigid 
in various senses [3j and we will see below in Corollary 11.51 that our points 
£{n) are far from being rigid. For random points one ha^ that E ~ N'^ but 
that E — N{N — 1) has no definite sign. Our first result is that to leading 
order the points £{n) have the same energy as the above. 

Theorem 1.1. There is some 5 > so that 

(1.7) E{S{n)) = N^ + OiN^-^) 
as n ^ oo, n 7^ 0, 4, 7 mod 8. 

We have not been able to say anything about the sign of E{£{n)) — N{N — 
1) which according to Table [LT] appears to vary. 

1.2. Point pair statistics. The point pair statistic and its variants is at 
the heart of our investigation. It is a robust statistic as for as testing the 
randomness hypothesis and it is called Ripley's function in the statistics 
literature ^8\. For Pi, ... Pat G S'^ and < r < 2, set 

(1.8) Kr{Pi,...,PN)-= 1 

\P^-PJ\<r 



Here and elsewhere, ~ is the usual asymptotic symbol denoting convergence to one of 
the ratio of the two sides. 
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to be the number of ordered pairs of distinct points at (Euclidean) distance 
at most r apart. For fixed e > 0, uniformly for A^~^^'^ < r <2, one has that 
for random points (the binomial process) 

(1.9) Kr{Pi,...,PN) ^\n{N 

Based on the results below as well as some numerical experimentation, 
we conjecture that for n square-free the points £{n) behave randomly w.r.t. 
Ripley's statistic at scales A^""^"*""^ < t < 2; that is 

(1.10) /Cr('?(n)) ~ — - — , as n — ^ oo . 

One of our main results is the following which shows that (jl.lOp is true 
at least in terms of an upper bound which is off only by a multiplicative 
constant. 

Theorem 1.2. Assume the Generalized Riemann Hypothesis (GRH). Then 
for fixed e > and N~^^'' <r<2, 

kr{£{n)) 

where the implied constant depends only on e. 

Remark: We do not need the full force of GRH here, but rather that there 
are no "Siegel zeros". 

We have not succeeded in giving individual lower bounds for Kr{£{n)), 
what we can show is that at the smallest scale (jl.lOp holds for most n's. 



Theorem 1.3. There is some 5q > such that for fixed < 6 < 6o and 



Kr{S{n)) 
for almost all n. 

1.3. Nearest neighbour statistics. Closely connected to K is the distri- 
bution of nearest neighbour distances dj, i. e. the distance from Pj to the 
remaining points. Area considerations show that — 16. It is more 

convenient to work with these squares of the distances. In order to space 
these numbers at a scale for which they have a limiting distribution in the 
random case, we rescale them by their mean for the random case, i.e. replace 
d^j by x^l' Thus for Pi, . . . ^Pn £ S"^ define the nearest neighbour spacing 
measure /x(Pi, . . . , Pn) on [0, oo) by 

1 ^ 

(1.11) ^(p^,...,p^):=_^5^^, 

i=i 

where (5g is a delta mass at G M. Note that the mean of /x is at most 1 and 
that for random points we have 

(1.12) /Li(Pi, . . . , Pat) ^ e'^'dx, as N ^ oo 
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Based on this and numerical experiments (see figure [2]) we conjecture: 
Conjecture 1.4. As n — )• oo along square-free integers, n ^ 7 mod 8, 
(1.13) f^{£{n)) e-'^dx . 

minimal distances for (10^+l)-th prime 

} 
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Figure 2. A histogram of the scaled minimal spacing be- 
tween lattice points for for n = 179424691, the 10,000,001- 
th prime, where Nn = 94536, and modulo symmetries there 
are 1970 points. The smooth curve is the exponential distri- 
bution e~^. 

As a Corollary to Theorem 11.21 we have 

Corollary 1.5. Assume GRH. If v is a weak limit of the ii{£{n)) then v is 
absolutely continuous, in fact there is an absolute constant C4 > such that 

(1.14) u < adx 

Corollarv 11.51 implies that the if(n)'s are not rigid for large n since for 
rigid configurations, IJi-p-^,...,Pj^ — >• ^-j^i^/y^- ^l^o in as much as it ensures that 
such a V cannot charge {0} positively, it follows that almost all the points 
of £{n) are essentially separated with balls of radius approximately N^^/"^ 
from the rest. Precisely given a sequence rf^ satisfying r/jv = o{N~'^/'^), all 
but o{N) of the points in £{n) have the ball of radius r/7v about them free 
of any other points. 

1.4. Minimum spacing and covering radius. Given Pi,...,P/v G S'^ 
define the minimum spacing to be the 

(1.15) m{Pi, . . . , P/v) := mindjj = mindj 
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This statistic is very sensitive to the placement of points and it is of arith- 
metic interest for £{n). From the area packing bound we have that 

(1.16) m(Pi,...,PAr) <4/ViV 

for any configuration. In fact the rigid configuration of Figure [T] maximizes 
m asymptoticahy 



(1.17) max m{Pi,...,PN) ^-2 



Pu-,Pn VN \ V12 

For random points the behaviour of the minimal spacing m is very different 
(1.18) m(Pi,...,P^)=iV-i+°(i). 

Based on the random point model as well as number theoretic considera- 
tions which involve a nonlinear and shifted variation of Vinogradov's least 
quadratic residue conjecture [20], we pose 



Conjecture 1.6. m(£{n)) = N ^+°(^) as n ^ oo. 

The lower bound in Conjecture 11.61 is an immediate consequence of the 
integrality of the points in £{n) since that implies that for the projected 
points Pi Pj £ £{n), we have \Pi — Pj\ > l/\/n and since N > n^/'^+°('^) 
the lower bound follows. It is the upper bound that appears difficult even 
assuming GRH. 

As with the previous statistics we can establish the conjecture for almost 
all n. Indeed it follows from Theorem 11.31 that 

Corollary 1.7. Given e> 0, M{£{n)) iV~^+^ for almost all n. 

Note that Conjecture 11.61 would follow from the stronger conjecture of 
Linnik |13] . that for e > and n odd and square- free (and n ^ 7 mod 8) 
there are xi,X2,X3 with |x3| < n'^ and xf -|- + X3 = n. 

Finally we examine the covering radius for £{n) though there is little of 
substance that we can prove. Given Pi,...,Pn £ S'^ the covering radius 
M{Pi, . . . , P/v) is the least r > so that every point of S'^ is within distance 
at most r of some Pj. Again an area covering argument shows that for any 
configuration M(Pi, . . . , P/v) > 

As a statistic, the covering radius M is much more forgiving than the 
minimal spacing m in that the placement of a few bad points does not affect 
M drastically. In particular for random points, M < iV~i/2+o(i). 

Based on this we conjecture the following, though admittedly it is based 
on much less evidence than the previous conjectures. 

Conjecture 1.8. M{£{n)) = Ar-i/2+o(i) as 00. 

An effectivization of the equidistribution of £{n) [SI [5] which is needed in 
the proof of Theorem 11.11 yields an a > such that M{£(n)) ^ A^^°. 
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1.5. Higher dimensions. The distribution of the solutions to 

(1.19) xf+xl^ hxj=n 

for t 7^ 3 is very different and certainly non-random. Firstly for t = 2 and say 
n a prime, n = 1 mod 4, there are exactly eight solutions to (jl.l9p . So there 
is little to say about the distribution for individual such n's. However for 
"generic" n's which are sums of two squares, the projections of the solutions 
to the unit circle are uniformly distributed [12117], and for such n's the local 
statistical questions certainly make sense. 

For t > 4, the projections onto the unit sphere of the solutions to (jl.l9p 
can be examined using the same techniques that we use for t = 3, with the 
main differences being that the analysis is easier and the local behaviour 
is no longer random. We only discuss the last feature and since it is only 
enhanced with increasing t, we stick to t = 4. Let <?4(n) be the set of 
solutions to ()1.19p and let <?4(n) be the projection of this set to S^, the 
unit sphere in M'^. The first difference to t = 3 is that := ^£4(11) is 
a regularly behaved function of n. When divided by 8 it is multiplicative 
and for n = p an odd prime ^£4^{p) = 8{p + 1). Thus the number of points 
being placed on satisfies 

(1.20) N = 

at least for odd n. For N random points on the two point function Kr 
defined as in (fLSl) satisfies that for e > and iV^^/a+e < ^ < 2 

(1.21) Kr.{Pi,...,PN)^ N{N-l)Vir) 

where V{r) is the relative volume of a cap {x G : |x — xo| < r}; for small 
r, V{r) ~ ^r^. 

On the other hand for £4(71), the integrality of the corresponding points 
in £4{n), implies that for x ^ y, \x — y\ > l/\/n and hence for a; / y G £4{n) 

(1.22) |x - y| > iV-^/2+°W 
In particular 

(1.23) Kr{£4{n)) =0, r < iY^^/^-e 

Thus at the scales N^'^^'^^'' < r < N^'^/'^~'', the point pair function for £4{n) 
and that for random points are very different. 

This difference is also reflected in the minimum spacing function m{£4^{n)) 
for points on S^. From (fT^ we have the lower bound m{£4{n)) > 
N~^/'^~^°^^^ and on the other hand there is a similar upper bound, namely 

Proposition 1.9. 



(1.24) 



m{£4{n)) = iV-i/2+o(i) 
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This is in sharp contrast to random points on for which 

(1.25) m(Pi,...,P/v) = iV-2/3+°(i) 

Thus the points <?4(ra) are much more rigid than random points but they 
are far from being fully rigid as the latter satisfy (locally these points are 
placed at the vertices of the face centered cubic lattice [2\): 

2 7r2/3 

(1.26) ^max^m4(Pi,...,PAr) ~ ^;^c, c = 

The nonrandom behaviour of the points £4(12) manifests itself at a much 
larger scale as well, as is demonstrated by the minimum covering radius 
M4(Pi, . . . , P/v). While being very nonrigid, random points cover 53 quite 
well. For them we have 

(1.27) M4(Pi, . ..,Pn) = Ar-V3+o(i) ^ 

Somewhat surprisingly the points £4(71) which are more rigid than random 
points, are poorly distributed in terms of covering. This phenomenon of 
what might be called "big holes" was first observed in the context of ap- 
proximations of 2 X 2 real matrices by certain rational ones, by Harman [9j. 
For £4(11) we have 

Proposition 1.10. 

(1.28) M{£4(n)) > iV-i/4+o(i) _ 



2. Outline of the proofs 

For n squarefree the general mass formula of Minkowski and Siegel, which 
in the following special case is due to Gauss, expresses Nn in terms of 
L{l,Xd„) where Xd„ is the quadratic character associated to the field Q(\/— n) 
of discriminant d„. From this and Siegel's lower bound on L(l,Xd) it fol- 
lows that Nn ^ n^/'^~'' for any e > (ineffectively). The key tool in our 
analysis of the local point-pair functions is the mass formula applied to the 
representations of the binary form nu^ + 2tuv + nv^ by the ternary form 
x\ + X2 + x\ = {x,x). Since this ternary form has one class in its genus 
the above, which counts the number A{n,t) of pairs (x,y) G £in) x £{n) 
with {x, y) = t, is given by a product of local densities. Again this is a 
special case of the mass formula, for which an elementary proof as well as 
an explicit form was given in [19j, and this was a critical ingredient in Lin- 
nik's approach to the equidistribution of £{n.) (see [6] for a recent exposition 
and extension of his method). The local to global formula allows us to give 
rather sharp upper bounds for A{n,t). These are then used to control the 
contributions of nearby points in the sum (11.5P in the course of proving The- 
orem 11.11 For pairs of points that are not too close we use modular forms 
and in particular Duke's theorem. Specifically we effectivise that analysis 
by giving a power saving (namely A^~", for some q > 0) upper bound for 
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the spherical cap discrepancy of the points £{n). Putting these two together 
leads to Theorem 11.11 

The proof of Theorem 1 1 . 2 1 also uses the local formula for A(n, t), this time 
giving upper bounds for this quantity when summed over t in short intervals. 
It is critical that these upper bounds are sharp up to a universal factor and 
depend only on the subtle function A^^ and not on n. We achieve this by 
adapting the upper bound sieve method of Nair [15j to our setting. This 
leads to an upper bound in terms of a product of local densities of primes 
connected with Xdn- It is here that we need to assume that there are no 
Siegel zeros in order to ensure that there is no dependence on n. 

The almost all result in Theorem 11.31 is proven by computing the as- 
ymptotic mean and variance of Kr{£{n)) — N^r'^ /A, with n < R. This is 
approached by analyzing similar asymptotics for 

(2.1) K,,{£{n))= 1 

x—y=h 

and 

(2.2) Kh,k{£in))= Yl 1' 

x—y=h, z—w=k 

The behaviour as ii — oo of J2n<R^h{£{n)) may be determined ele- 
mentarily, while that of J2h<R^h,k{£{n)) can be derived using Klooster- 
man's circle method for quadratic forms in 4 variables (see for example 
[H], [TO]). The leading terms are given as products of Hardy-Littlewood lo- 
cal densities. The behaviour of Ylin<R-^r{£{n))Nn and "^nKR^n be 
determined using the Besicovich r-almost periodic properties of Nnl\pn [16] . 
We rederive this almost periodicity directly using the circle method and this 
allows us to compare the various local densities directly. 

The proof of Proposition 11.91 is immediate from Legendre and Gauss' 
Theorem. Namely n — = x\ + X2 + x\ has a solution for a = 1 or a = 2 
(recall n is odd) . Proposition 11.101 follows by considering annuli about the 
north pole (1,0,0,0). 
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